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Abstract. Within a contest there is some probability Mi(t) that contestant 
i will be the winner, given information available at time t, and Mi(t) must 
be a martingale in t. Assume continuous paths, to capture the idea that 
relevant information is acquired slowly. Provided each contestant's initial 
winning probability is at most b, one can easily calculate, without needing 
further model specification, the expectations of the random variables Nb = 
number of contestants whose winning probability ever exceeds b, and D a b = 
total number of downcrossings of the martingales over an interval [a, b]. The 
distributions of Nb and D a b do depend on further model details, and we 
study how concentrated or spread out the distributions can be. The extremal 
models for Nb correspond to two contrasting intuitively natural methods for 
determining a winner: progressively shorten a list of remaining candidates, 
or sequentially examine candidates to be declared winner or eliminated. We 
give less precise bounds on the variability of D a b- We formalize the setting of 
infinitely many contestants each with infinitesimally small chance of winning, 
in which the explicit results are more elegant. A canonical process in this 
setting is the Wright-Fisher diffusion associated with an infinite population 
of initially distinct alleles; we show how this process fits our setting and raise 
the problem of finding the distributions of Nf, and D a b for this process. 



1. Introduction 

Given a probability distribution p = (pi,i > 1) consider a collection of 
processes (Mj(t),0 < t < oo,i > 1) adapted to a filtration (F t ) and satisfying 

(i) Mi(0)=pi,i > 1; 

(ii) for each t > we have < M^t) < 1 Vz and £\ M iif) = 1; 
(hi) for each i > 1, (Mj(t),t > 0) is a continuous path martingale; 

(iv) there exists a random time T < oo a.s. such that, for some random 7, 
Mi{T) = 1 and Mj(T) = VjV l 

Call such a collection a p-feasible process, and call the Mj(-) its component mar- 
tingales. To motivate this definition, consider contestants in a contest which 
will have one winner at some random future time. Then the probability Mi(t) 
that contestant % will be the winner, given information known at time t, must be 



Date: November 12, 2012. 

2000 Mathematics Subject Classification. Primary: 60G44; Secondary: 91A60. 
Key words and phrases. Entrance boundary, fluctuations, martingale; upcrossing; Wright- 
Fisher diffusion. 

Aldous's research supported by N.S.F Grant DMS-0704159. 

1 



2 



DAVID ALDOUS AND MYKHAYLO SHKOLNIKOV 



a martingale as t increases. In this scenario all the assumptions will hold auto- 
matically except for path-continuity, which expresses the idea that information 
becomes known slowly. 

In view of the fact that continuous-path martingales have long been a central 
concept in mathematical probability, it seems curious that this particular "con- 
test" setting has apparently not previously been studied systematically. More- 
over the topic is appealing at the expository level because it can be treated at 
any technical level. In an accompanying non-technical article for undergradu- 
ates [?] we show data on probabilities (from the Intrade prediction market) for 
candidates for the 2012 Republican U.S. Presidential Nomination. The data is 
observed values of the variables Nb and D a b below, and one can examine the 
question of whether there was an unusually large number of candidates that 
year whose fortunes rose and fell substantially. In this paper, the proof in sec- 
tion [3] of distributional bounds on Nb is mostly accessible to a student taking 
a first course in continuous-time martingales, and subsequent sections slowly 
become more technically sophisticated. 

The starting point for this paper is the observation that there are certain 
random variables associated with a p-feasible process whose expectations do 
not depend on the actual joint distribution of the component martingales, and 
indeed depend very little on p. For < a < b < 1 consider 

Nb := number of % such that supMj(t) > b 

t 

D a b '■= sum over % of the number of downcrossings of Mj(-) over [a, b\. 

Straightforward uses of the optional sampling theorem (described verbally in 
[?] as gambling strategies) establish 

Lemma 1. IfmaXiPi < b then for any p-feasible process, 
E[N b ] = 1/6, E[D asb ] = (1 - b)/(b - a). 

In contrast, the distributions of Nj, and D a b will depend on the joint distri- 
butions of the component martingales, and one goal of this paper is to study 
the extremal possibilities. Here is our result for N b . 

Proposition 2. (a) If max^pj < b then there exists a p-feasible process for 
which the distribution of N b p is supported on the integers [l/b\ and [1/6] brack- 
eting its mean 1/6. 

(b) There exists a family, that is a p-feasible process for each p, such that the 
distributions of N b p satisfy 

(1.1) dist(iV?) — > Geometric(6) as m&xpi — > 0. 

i 

(c) Any possible limit distribution for as maxjpj — > has variance at most 
(1 — b)/b 2 , the variance of Geometric (b). 

Clearly the distribution in (a) is the "most concentrated" possible, and part 
(c) gives a sense in which the Geometric (b) distribution is the "most spread out" 
distribution possible. The proof will be given in section [31 The construction 
for (a) formalizes the idea that we maintain a list of candidates still under 
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consideration, and at each stage choose one candidate to be eliminated. The 
construction for (b) formalizes the idea that we examine candidates sequentially, 
deciding to declare the current candidate to be the winner or to be eliminated. 
Returning briefly to the theme that this topic is amenable to popular exposition, 
with some imagination one can relate these two alternate ideas to those used 
in season-long television shows. Shows like Survivor overtly follow the idea for 

(a) , whereas the idea for (b) would correspond to a variant of Millionaire 

in which contestants were required to try for the million dollar prize and where 
the season ends when the prize is won. 

We give an analysis of downcrossings D a j, in section HJ though with less 
precise results. The construction that gave the Geometric limit distribution 
for Nb in (11. ip also gives a Geometric limit distribution for D a b (Proposition 
[6]). We conjecture this is the maximum- variance possible limit, but can give 
only a weaker bound in Proposition [7J As for minimum- variance constructions, 
Proposition [8] shows one can construct feasible processes for which, in the limit 
as b with a/b bounded away from 1, the variance of D a b is bounded by 
a constant depending only on a/b. The case a/b ~ 1 remains mysterious, but 
prompts novel open problems about negative correlations for Brownian local 
times - see section [7J 

1.1. O-feasible processes. As a second goal of this paper, it seems intu- 
itively clear that the concept of p-feasible process can be taken to the limit 
as maxjpi — > 0, to represent the idea of starting with an infinite number of con- 
testants each with only infinitesimal chance of winning. Informally, we define 
a O-feasible process as a process with the properties: 

(i) for each to > 0, conditional on Mj(t ) = Pi,i > 1, the process (Mj(t + 
t),0<i<oo,z>l)isa p-feasible process; 

(ii) supj Mj(t) — > a.s. as t 1 0. 

There is some subtlety in devising a precise definition, which we will give in sec- 
tion Once this is done we can deduce results for general O-feasible processes 
as limits of results for p-feasible processes under the regime max^pj — > 0, and 
also we can construct specific O-feasible processes by splicing together specific 
p-feasible processes under the same regime (Proposition [TT]) . 

By eliminating any dependence on p, results often become cleaner for 0- 
feasible processes. For instance Proposition [2] becomes 

Corollary 3. (a) There exists a O-feasible process such that, for each < b < 1, 
the distribution Nb is supported on the integers [l/b\ and \l/b] bracketing its 
mean 1/b. 

(b) Given < b < 1, there exists a O-feasible process such that, for each 
bo < b < 1, Nb has Geometric (b) distribution. 

(c) Moreover for any O-feasible process and any < b < 1 the variance of Nb 
is at most (1 — b)/b 2 , the variance of Geo metric (b). 

Setting aside the "extremal" questions we have discussed so far, another mo- 
tivation for considering the class of O-feasible processes is that there is one 
particular such process which we regard intuitively as the "canonical" choice, 
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and this is the 0- Wright-Fisher process discussed in section El This connection 
between (a corner of) the large literature on processes inspired by population 
genetics and our game contest setting seems not to have been developed before. 
In particular, questions about the fluctuation behavior of the 0- Wright-Fisher 
process - the distributions of Nb and D a b - arise more naturally in the contest 
setting, though it seems hard to get quantitative estimates of these distribu- 
tions. 

2. Preliminary observations 

2.1. The downcrossing formula. In our setting of a continuous-path mar- 
tingale M(-) ultimately stopped at or 1, recall the "fair game formula" 

(2.1) P(M(t) hits b before a |M(0) = x) = fff, 0<a<a;<6<l 

from which one can readily derive the well known formula for the expectation 
of the number D of downcrossings of M (■) over [a, b\. for < a < b < 1, 

(2.2) E[D\M(0) = x] = if < x < & 

(2.3) = b -^ i£b<x<l. 

Moreover, starting from b there is a modified Geometric distribution for D: 



F(D = 0|M(0) = b) - b ~ a 



l-a 

d-1 



(2.4) F(D = d\M(0) = b) = g (fgEf) " (l - fgfg) , d > 1. 

2.2. The multivariate Wright-Fisher diffusion. Textbooks introducing dis- 
crete time martingales often use as an example (e.g. [?] Example 10.2.6) the 
discrete-time Wright-Fisher model for genetic drift of a single allele. Note that 
throughout what follows, we consider only the case of no mutation and no se- 
lection. It is classical that the infinite population limit of the /c-allele model is 
the multivariate Wright-Fisher diffusion on the k — 1-dimensional simplex, that 
is with generator 

(2-5) IJ^-s.) . 

i,j=i ■> 

Each component is a martingale, the one-dimensional diffusion on [0, 1] with 
drift rate zero and variance rate x(l — x). There has been extensive work since 
the 1970s on the infinitely-many-alleles case, but this has focussed on the case of 
positive mutation rates to novel alleles, in which case the martingale property no 
longer holds. In our setting (no mutation and no selection) it is straightforward 
to show directly (see section [6j) that for any p = > 1) with countable 

support there exists what we will call the p- Wright-Fisher process, the infinite- 
dimensional diffusion with generator analogous to ( 12. 5 j) starting from state p, 
and that this is a p-feasible process. So we know that p-feasible processes do 
actually exist, and these p- Wright-Fisher processes will be useful ingredients in 
later constructions. (When p has finite support we could use instead Brownian 
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motion on the finite-dimensional simplex, whose components are killed at and 
1, but this does not extend so readily to the infinite-dimensional setting). 

It is convenient to adopt from genetics the phrase fixation time for the time 
T at which the winner is determined. 

2.3. Constructions using Wright-Fisher. In a Wright-Fisher diffusion we 
have Y2i Mi (t) = 1 , but trivially we can consider a rescaled Wright-Fisher 
diffusion for which X^^W i s a prescribed constant. 

Our constructions of feasible processes typically proceed in stages. Within 
a stage we may declare that some component martingales are "frozen" (held 
constant) and the others evolve as a rescaled Wright-Fisher process. In par- 
ticular if only two component martingales are unfrozen, say at the start S of 
the stage we have Mi(S) = Xi and Mj(S) = Xj, then during the stage we have 
a "reflection coupling" with Mj(t) + Mj(t) = + Xj, and we can choose to 
continue the stage until the processes reach Xj + Xj and 0, or we can choose to 
stop earlier. 

An alternative construction method is to select one component martingale 
Mi(S) at the start of the stage, let (Mj(-), 1 — Mj(-)) evolve as the two-allele 
Wright-Fisher process during the stage, and set Mj(-) = ^^(s) x (1 ~~ ^i('))- 
We describe this construction by saying that the processes (Mj(-),j ^ i) are 
tied. 

Both constructions clearly give continuous-path martingale components. 

The results in sections |3] and H] are based on concrete calculations and con- 
structions, though in applying them to O-feasible processes we "look ahead" 
and quote results from later (Propositions [IT] and IT2]) which are designed for 
this purpose, formalizing the intuitive description from section [1J] so as to allow 
results to be easily interchanged between p-feasible and O-feasible processes. 

3. Proofs of distributional bounds on N b 

Proof of Propositioned). Fix b. Run a Wright-Fisher process started at p 
until some Mj(-) reaches b. Freeze that % and run the remaining processes as 
rescaled Wright-Fisher until some other Mj(-) reaches b. Freeze that j and 
continue. After a finite number of such stages we must reach a state where all 
component martingales except one are frozen at b or at 0, and the remaining 
one is in [0,6]. Because X^-^W = 1 the number frozen at b must be \l/b\ 
and the remaining martingale must be at 1 — |_ 1 / & J . Finally, unfreeze and run 
from this configuration to fixation as Wright-Fisher. Clearly takes only the 
values [l/b\ and [1/6] . □ 

Proof of Corollary [3]f a). This construction is similar to that above, but is closer 
to our earlier informal description "maintain a list of candidates still under 
consideration, and at each stage choose one candidate to be eliminated" . 

For each integer m > 2, we will define a stage which starts with m component 
martingales at 1/m, and ends with m — 1 of these martingales at l/(m — 1) and 
the other frozen at 0. To construct this stage, run as Wright-Fisher until some 
Mj(-) reaches l/(m — 1). Freeze that i and run the remaining martingales as 
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rescaled Wright-Fisher until some other Mj-(-) reaches l/(m — 1). Freeze that 
j and continue. Eventually we must reach a state where m — 1 martingales 
are frozen at l/(m — 1) and the remaining process is 0. This stage takes some 
random time r m with finite expectation; without needing to calculate it, we can 
simply rescale time so that E[r m ] = 2~ m . 

Intuitively, we simply put these stages together, to obtain a O-feasible process 
in which, for each M > 1, at time ^2 m>M T m there are exactly M martingales 
at 1/M. Proposition ITT1 formalizes this construction. This process satisfies the 
assertion of the Corollary for each b = 1/M, and then for general b because Nj, 
is monotone in b. □ 

Remark 1. Let us call a O-feasible process with the property above, that for 
each M > 2 there is a time at which there are exactly M component martingales 
each at value 1/M, a Survivor process. We placed the proof here to illustrate 
the technical issue arising in making precise the construction of such a O-feasible 
process, which is to arrange consistent labeling of each component martingale 
across the different stages. The point is that the one out of the M that does 
not reach 1 / (M — 1) is a uniform random pick, so we cannot just label them as 
1, . . . ,M for each M. 

Proof of Proposition^b). Fix b. Write p in ranked order pi > pi > . . ., and 
write J for the first term (if any) such that pj + i/(l — p\ — . . . — pj) > b. 

We use the "tied" construction from the start of this section. Run (Mi(-), 1 — 
Mi(-)) as Wright-Fisher started from (j>i, 1 — pi) and stopped at Si := min{t : 
Mity) = or 1}, and for i > 2 set 

M(t) = T ^ r (l-M 1 (t)), 0<t<Sx. 

So Mj(-) is a martingale on this time interval. Note that if J ^ 1 then no Mj(-) 
can reach b before time Si, for i > 2. 

If Mi(Si) = 1 the process stops. If Mi (Si) = then for i > 2 we have 
Mi(Si) = pi/(l - pi). For t > Si run (M 2 (-),l - M 2 (-)) as Wright-Fisher 
started from (jz^-, 1 ~ 1 ? j~ P2 ) and stopped at S 2 := min{t : M 2 (t) = or 1}, and 
for i > 3 set 

= T^( 1 - M ^))' S ^ ^ S *- 

If J ^ 2 then no Mj(-) can reach b before time S 2 , for i > 3. 

Continue in this way to define processes (Mj(t), Sj-i <t< Sj) for 1 < j < 
J, or until some Mj(-) reaches 1 and the whole process stops. If the process 
has not stopped by time Sj, continue in an arbitrary manner, which makes the 
resulting process p-feasible. Note that, if Mj(-) reaches b, then with probability 
exactly 1/6 it will reach 1, and that with probability 1 — Y2j<jPj ^ e process 
has not stopped by time Sj. 

Write = number of martingales j < J that reach b. We can now apply 
Lemma H] below to Z — N^ J \ and deduce that < Z' = Geometric(fe) 
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with Z' constructed in Lemma HI Then 

nN b ^Z') < E[\N b -Z'\ 



< E[Z' - N b {J) ] + E[N b - iV, (J)l 



b 



3>J 



Finally, as p varies we have 



if max j»j — > then Pj ~~ ^ 

j>J(p) 

establishing the limit result (II. lft . □ 

Lemma 4. Given < b < 1 and probabilities qi,l < i < J define a counting 
process by: for each i, given not yet terminated, 

with probability qib, increment count by 1 and terminate; 

with probability q^l — b), increment count by 1 and continue; 

with probability 1 — q i} continue. 
Let Z be the value of the counting process after step J or at time T ( the termina- 
tion time, if any), whichever occurs first. Then there exists Z' = Geometric(fr) 
such that Z < Z' . 

Proof. Augment the process by setting = 1, i > J and follow the algorithm 
for all i > 1. The process must now terminate at some a.s. finite time T", 
at which time the value Z' of the counting process has exactly Geometric (6) 
distribution. □ 

Proof of Proposition^c). Fix b and for k > 1 let Sk < oo be the first time at 
which k distinct component martingales have reached b. If iV& > k, then at 
time Sk one martingale takes value b, the other k — 1 that previously reached b 
take some values Z±, . . . , Z^-i, and the remaining martingales take some values 
Mj(Sk) < b. The chance that such a remaining martingale subsequently reaches 
b equals Mj(Sk)/b, and so, on {7V& > k}, 

/ k-l 

(3.1) E[N b - k\F Sk ] = b-^MjiSu) = b- 1 1 - 




So 



E[{N b - k) + ] < l=H F(N b > k) 



and summing over k > 1 gives 



E 

Finally, 



N b (N b -l) 



var(iV 6 ) = 2E[ Nb(N 2 b ~ 1) ] + E[N b ] - (EN b ) 2 < 



fe 2 



□ 
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For later use (section [5]) note that to have equality in the final display above 
we need equality in (13. lj) . implying each Zj = 0, that is the martingale compo- 
nents that previously reached b have all reached zero. We deduce 

Corollary 5. If, for a p-feasible process, Nf, has Geometric(b) distribution, 
then there is no time at which more than one component martingale is in [b, 1] . 

Proof of Corollary^fc). This follows from Proposition [2(c) and the definition 
(section \5§ of O-feasible process via embedded p-feasible processes. □ 

Proof of Corollary^b). Given b , consider the vector p of Geometric probabil- 
ities with 

(3.2) Pi = 6o(l - b y-\i > 1. 

The construction in the proof of Proposition [2(b) and its analysis show that 
for this p-feasible process and any b > 6 we have J = oo and that iVj, has 
Geometric (b) distribution. So it is enough to show that there exists a O-feasible 
process and a stopping time at which the values of the component martingales 
are p. But Proposition [T2l shows this is true for every p. □ 

4. Distributional bounds on downcrossings 
4.1. The large spread setting. 

Proposition 6. Given bo > 0, there exists a O-feasible process such that D a t,+ 1 
has Geometric (jEr) distribution, for each b < b < 1 and < a < b. 

The corresponding result (cf. Proposition 12(b)) holds for p-feasible processes 
in the limit as maxj^j — > 0. 

Proof. As in the proof of Corollary [3(b), we may start with the Geometric(fro) 
distribution p at (13. 2p and use the construction in the proof of Proposition [2(b). 
Every time a martingale component reaches b, the other components must be 
at positions 

(1-6) 6q(1 - fto)*" 1 ,* > 1. 
Similarly, each time the component completes a downcrossing of [a, b] the other 
components must be at positions 

(1-a) & (l-6orV>l- 
The event that there are no further downcrossings is the event that, after the 
next time some component reaches b, it then reaches 1 before a, and this has 
probability (b — a)/(l — a) by (12. ip . So 

¥(D ab = i\D ab > z) = (6 - a)/{\ - a), i > 1. 
By the same argument P(-D a & = 0) = (b — a)/(l — a). □ 

The variance of the Geometricf^ 11 ^) distribution can be written as 
(4-1) (HY + H- 

It is natural to guess, analogous to Corollary [3(c), that this is an upper bound 
on the variance of D a b in any O-feasible process. 
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Conjecture 1. For any O-feasible process, 

var(^)<0 2 + ^. 

The following result establishes a weaker bound. One can check that in the 
a j" b limit this bound is first order asymptotic to (^z^) 2 , which coincides with 
the first order asymptotics in (14.11) . 

Proposition 7. For any O-feasible process and any < a < b < 1, 

, s /7i-& (i-^) 2 v /2 1 / 2 ^ 2 (i-b) 2 

var(Bot) < ^_ + 2 L_L + „ j + „v» j _ L_J_ 

where fi := min((2 — b)/b 2 , 1/a 2 ). 

Proof. Fix < a < b < 1 and consider an arbitrary O-feasible process. Call a 
particular component martingale at a particular time active if it is potentially 
part of a downcrossing of [a, b\. That is, the martingale is initially inactive; it 
becomes active if and when it first reaches b; it becomes inactive if and when 
it next reaches a; and so on. So a martingale at x is always active if x > b, is 
always inactive if x < a, but may be active or inactive if a < x < b. 

Given that a particular martingale is currently at x, the mean number of 
future downcrossing completions equals, by (12.21 12.31) 

— if inactive; (i-g)& jf ac ^j ve _ 

Analogously to the proof of Proposition 12(c), consider the time Sk at which the 
fc'th downcrossing has been completed. On {Sk < oo}, 

(b-a)E[D ab -k\T Sk ] = (l-b) M i( S k) + b X) Q--Mi(S k )) 

i inactive i active 

and because ^Mi(-) = 1 this becomes 

b+(b-a)E[D ab -k\F Sk }= Yl M i(Sk)+ Yl k 

i inactive i active 

The number of active martingales at time is at most := number of 
martingales that reached b before time Sk- So the right side cannot be larger 
than the value taken when min(A r ^ fc ' ) , 1/a) active martingales take values just 

above a and the remaining value of 1 — amm(N^ k \ 1/a) is distributed among 
the inactive martingales. This gives the upper bound 

b+(b-a)E[D ab -k\J r Sk ) < l-amm(N b {k \ l/a)+bmm(N { b k \ 1/a) on {S k < oo}. 

The event {Sk < oo} is the event {D ab > k}, so taking expectations and 
rearranging gives 

E[(D ab - k)+] < JeJ nD ab >k)+ E[mm(N { b k \ 1/a) l {Dab > k} }- 
Because N^ k) < N b , summing over all k > 1 gives 
(4.2) \E[D ab (D ab - 1)] < JEh nDab] + E[min(iV 6 , 1/a) D ab }. 
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Apply the Cauchy-Schwarz inequality to the second summand on the right side 
and use E[D ab ] = to conclude 

(4.3) E[Dl b ] < ^ + 2 |i^ + 2E[min(iV b ,l/a) 2 ] 1/2 EK] 1/2 . 

Next, for positive constants C\,C 2 we have the elementary implication 

if < a < d + 2 C 2 ^/a then y/a~ < sJd + C 2 + C 2 . 
In our situation, this gives 

1 /2 

\/nm < (^ + 2|l^|J + E[min(iV b ,l/a) 2 ]) +E[min(iV fe , 1/a) 2 ] 1,2 . 

Using first Jensen's inequality and then the result ( Corollary [3^c)) that var(A r f ) ) < 
(1 — b)/b 2 , we see 

E[min(iV b , 1/a) 2 } < min(E[A^ b 2 ], 1/a 2 ) < min((2 - b)/b 2 , 1/a 2 ) 

from which the inequality in the proposition readily follows. □ 

4.2. The small spread setting. Proposition [2]^a) showed that the spread 
of Nf, could be very small. To see that the case of D ab must be somewhat 
different, recall that for a martingale component which reaches 6, its number of 
downcrossings has the modified Geometric distribution (12. 4p with mean 6(1 — 
6)/(6 — a). So if we fix b and consider limits in distribution as a t b, we must 
obtain a limit of the form 

, N b 

o-a \ - ( 

where each £j has Exponential(l) distribution. And although there will be some 
complicated dependence between (Nb, £i, £2, • • •) it is clear that the limit cannot 
be a constant, and therefore in any p-feasible process the variance of D a b as 
a t b must grow at least as order {b — a)~ 2 . We will not consider that case 
further here (but see an open problem in section [7j), instead turning to the case 
where a/b is bounded away from 1. Here, in a O-feasible process, E[D a b] grows 
as order 1/6 as b ^ 0. The next result shows there exist O-feasible processes for 
which the variance of D a b remains 0(1). 

The idea behind the construction is to exploit reflection coupling. For in- 
stance, starting with 2m components at 6, a reflection coupling moves the 
process to a configuration with m components at a and m at 26 — a while 
adding m downcrossings; one can extend this kind of construction to make the 
process pass through a deterministic sequence of configurations while adding a 
deterministic number of downcrossings. 

Proposition 8. For each < a < 1 there exists a constant C(a) < 00 such 
that: given < a\. < 6& — > with a^/bu —> a, there exist O-feasible processes 
such that 



(4.4) 



limsup var(D afcjfcfc ) < C(a). 
k 
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Proof. Fix k, set (a, b) = (a^, &&) and with an abuse of notation write a = a^/bk- 
By Proposition [T2] we may assume we have a po-feasible process, where po has 
finite support and its components are in (0,6). 

The proof makes repeated use of the following kind of construction. Specify 
an interval [ao, bo], freeze martingale components initially outside that interval, 
run the other components as a rescaled Wright-Fisher process and freeze them 
upon reaching ao or bo (typically there will be one component ending within 
(ao, bo)). Note this construction has a particular "deterministic" property, that 
in the final random configuration (Mi(t),i > 1) the ranked (decreasing ordered) 
values rank(Mj(t),i > 1) are non-random, determined by the (ranked) initial 
values. This holds because ^¥,(4) = 1. 

The central idea of the proof is the following lemma. 

Lemma 9. Write K = K(a) = 6|_y~J — 1. There exists a po-feasible process 
which reaches a configuration pi with at most one martingale with value in 
(6, 1] and at most K martingales taking values in (0,6] , having accomplished a 
deterministic number of downcrossings before that time. 

Proof. We construct the process in stages. At the start of each stage, we con- 
sider the first case in the list below which holds, and do the construction speci- 
fied below for that case. If no case holds then stop; note the property "at most 
K martingales taking values in (0, 6]" will then be satisfied. 

Case 1. There are at least 1 + Lyz^J martingales at b; 

Case 2. There are at least 2 Lt^J + 1 active martingales in (a, b); 

Case 3. There are at least 2|_j-z^J + 1 inactive martingales in (a, 6); 

Case 4- There are at least Ltz^J martingales in (0, a]. 

Construction in case 1. We let the martingales at b evolve according to the 
appropriately rescaled Wright-Fisher diffusion, while freezing all other martin- 
gales, and then freeze the evolving martingales that reach level a. At least [rz^J 
martingales will reach level a, and exactly one will be above b. Once all martin- 
gales are frozen, we let those at a evolve as the rescaled Wright-Fisher diffusion 
until they reach or b. Finally, if initially there were martingales above b, then 
we let all the martingales above b evolve as the appropriate Wright-Fisher diffu- 
sion and freeze those that reach b. This procedure adds a deterministic number 
of downcrossings (all in the first step), and leaves exactly one martingale above 
b. 

Construction in cases 2 and 3. In case 2 we let the active martingales in (a, b) 
evolve until they either reach a or b and freeze them at that time. All except 
one of these martingales reach a or b, so either at least Liz^J + 1 martingales 
end at b, or at least Lxz^J martingales end at a, adding a deterministic number 
of downcrossings. So the ending configuration will fit case 1 or case 4. In 
case 3 we do the same but with the inactive martingales instead; no additional 
downcrossings are added. 

Construction in case 4- We let the martingales in (0, a] evolve until they 
reach or b and freeze them at that time. At least one of them must reach 0, 
and no additional downcrossings are added. 
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The sequence of stages must end, because: in each case 4 stage at least 
one martingale is stopped at 0, and each case 1 stage creates at least one 
downcrossing, so there can be only a finite number of such stages; and each 
case 2 or 3 stage is followed by such a stage. 

Moreover each stage is "deterministic" , in the previous sense that the ranked 
configuration at the end of the stage is determined by the ranked configuration 
at the start, and therefore the ranked configuration pi at the termination of 
the entire construction is non-random, determined by the initial configuration 
Po- This implies the total number of downcrossings is deterministic, because 
the number within each stage is determined by that stage's starting config- 
uration. As already mentioned, pi has the property "at most K martingales 
taking values in (0, b]" by the termination condition. The number of martingale 
components taking values in (b, 1] is at most 1, because each case 1 stage ends 
that way and the other cases do not allow components to exceed level b. □ 

In view of Lemma HJ to complete the proof of the proposition it suffices to 
show (I4.4p for some pi-feasible process with pi as in Lemma |9j In fact we can 
take an arbitrary such process. The point is that (as noted earlier) the number 
of downcrossings D aa has a representation of the form 

N* 

D a b = Gi 
i=l 

where N* is the number of martingale components that hit b, and each Gi 
has the modified Geometric distribution (12. 4p . Without any knowledge of the 
dependence between (N*, Gi,G 2 , ■ ■ ■), the fact N* < K + 1 implies 

var(£> a6 ) < E[D 2 J < (K + 1) 2 E[G?]. 

It is easy to check that E[G 2 ] is bounded in the limit as b — > with a/b — > a < 1, 
and (H3D follows. □ 

5. O-FEASIBLE PROCESSES 

In section 15.11 we will give one formalization of the notion of a O-feasible 
process introduced informally in section 11.11 an d in sections 15.21 and 15.31 we 
give results allowing one to relate constructions and properties of O-feasible 
processes to those of p-feasible processes. 

There are several possible choices for the level of generality we might adopt. 
The "canonical" example of the 0- Wright-Fisher process, and the "Survivor" 
process featuring in Corollary El^a), have the property that at times t > the 
process has only finitely many non-zero components, so we could make this 
a requirement. Instead we will allow a countable number of non-zero compo- 
nents - "because we can" . In the other direction, consider the construction of 
reflecting Brownian motion R(t) from standard Brownian motion W(t) as 

R(t) := W(t) -mmW(s) 

s<t 
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and run the process until R(-) hits 1. Within our setting, interpret this as 
saying that at time t there is one contestant with chance R(t) of winning, the 
remaining chance 1 — R(t) being split amongst an infinite number of unidentified 
contestants each with only infinitesimal chance of winning. Informally this is a 
O-feasible process such that 

(5.1) Nf, has Geometric (b) distribution for every < b < 1, 

strengthening the assertion of Corollary E](b), but it does not fit our set-up 
which will require the unit mass to be split as a random discrete distribution 
at times t > 0. In fact Corollary \5\ implies that, within our formalization, no 
O-feasible process can have property (15. ip . One could choose a more general 
set-up which allows such "dust" , as in the literature [?] cited below, but we are 
not doing so. 

The existing classes of processes in the literature with somewhat similar qual- 
itative behavior - in the theory of stochastic fragmentation and coagulation pro- 
cesses [?] which studies partitions of unit mass into clusters, or in population 
genetics inspired processes associated with Kingman's coalescent, are (to our 
knowledge) explicitly Markovian, in which context the question becomes deter- 
mining the entrance boundary of a specific Markov process [?, ?]. Our setting 
differs in that we wish to continue making only the "martingale" assumptions 
(ii,iii,iv) at the start of the Introduction, and we are seeking to define a class 
of processes. 

The following observation shows that the most naive formalization does not 
work. 

Lemma 10. Let I be countable, There does not exist any process (Mj(i),0 < 
t < oo, i G 7) adapted to a filtration (Ft) and satisfying 

(i) for each t > we have < M^t) < 1 Vi and £\ M^t) = 1; 

(ii) for each i, (Mj(i),i > 0) is a martingale; 
(Hi) supj Mj(t) — > a.s. as 1 10. 

Proof. The martingale property implies E[Mj(£)] is constant in t. But by (i) 
and (iii) we have 



5.1. A formalization. The issue, indicated by Lemma [TU] above and the par- 
ticular Survivor example in Remark [H is to find a formalization which pre- 
serves the identity of martingale components as t varies. The often used device 
of simply ranking (decreasing-ordering) components at each time t does not 
work. Our formalization combines ranking and a point process representation. 
This is admittedly somewhat ad hoc; a different but equivalent formalization is 
mentioned in Remark [2J 

A probability distribution p with pi > p 2 > p-$ > . . . is called ranked] write 
V for the space of ranked probability distributions. For a general discrete 
distribution q = (qj,j G J) write rank(q) for its decreasing ordering, where 



limE[M;(t)] = 0. 




□ 
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zero entries are omitted. More generally, for a collection (Aj,j G J) of objects 
with the same index set as (qj,j G J), write rank(A,-, j G J||q) for the collection 
re-ordered so that q is ranked (this is not completely specified if the values qj 
are not distinct, but the arbitrariness does not matter for our purposes). 

Write Co for the space of continuous functions / : [0, oo) — > [0, 1] with /(0) = 
0. Consider a random point process on Co- That is, a realization of the process 
is (informally) an unordered countable set {/«(•)} °f functions or (formally) the 
counting measure associated with that set. We will use the former notation, 
which is more intuitive. We define a O-feasible process to be a random point 
process {M Q (-)} on C such that 

< M a {t) < 1; ^ M a (t) = 1, < t < oo 

a 

m&xM a (t) — > a.s. as t j. 

a 

and with the following property. For each to > and each ranked p, 

Conditional on rank(M a (to)) = p and on ^(to), the ranked process 

(5.2) rank(M a (t + -))\\{M a (to)}) is p-feasible. 

In words, given t we simply label component martingales as 1, 2, 3, . . . in de- 
creasing order of their values at to, and we can use this labeling over t < t < oo 
to define a process (Mi (to + t), t > 0, i > 1) which we require to be a p-feasible 
process, where p is the ranked ordering of (M a (to)). For Tt we take the natural 
filtration, generated by the restriction of the point process to (0,t]. 

By standard arguments, property (15. 2 p extends to any stopping time S with 
< S < oo: 

Conditional on rank(M Q (5')) = p and on J-'(S), the ranked process 

(5.3) rank(M a (S + -))\\{M a (S)}) is p-feasible. 

In our initial definition of a p-feasible process we assumed the initial config- 
uration p was deterministic. Now define a ©-feasible process to be a mixture 
over p of p-feasible processes; in other words, a process (Mi(t),i > l,t > 0) 
which, conditional on (Mj(0), i > 1) = (pi, i > 1), is a p-feasible process. So the 
ranked process rank(M a (S + -))\\{M a (S)}) in (I5.3p . considered unconditionally, 
is a ©-feasible process, and we describe the relationship (15. 3 p by saying this ©- 
feasible process is embedded into the O-feasible process via the stopping time S. 
Similarly, any stopping time within a ©-feasible process specifies an embedded 
©-feasible process. 

Remark 2. An essentially equivalent formalization would be to assign random 
Z7[0, 1] labels U a to component martingales, so the state of the process at t is 
described via the pairs (U a , M a (t)) for which M a (t) > 0, and this can in turn be 
described via the probability measure M a (t)5u a or its distribution function. 
We will use this "random labels" idea in an argument below. 
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5.2. A general construction of O-feasible processes. Given a O-feasible 
process and stopping times S k I a.s., the associated embedded ©-feasible 
processes are embedded within each other, and their initial values {M^ k \i > 
1) satisfy maxj — > a.s.. The following result formalizes the converse 
idea: one can construct a O-feasible process from a sequence of p-feasible or 
more generally ©-feasible processes embedded into each other, via Kolmogorov 
consistency. 

Proposition 11. Suppose that (fJ, k ,k > 1) are probability measures on V and 
that for each k there are families (M k (t), i > 1, < t < oo) such that 

(i) (M k (0),i > 1) has distribution fi^- 

(ii) Conditional on (M k (0),i > 1) = p, the process M k = (M k (t),0 < t < 
oo,i > 1) is p-feasible. 

(Hi) For k > 2 there is a stopping time Tk for M fc such that tk ■= E[Tfc] < oo 
and rank(Mf (7*.), i > 1) has distribution Hk-x- 

( W ) Y.k l k < °°- 

(v) Mf(0) as k oo. 

Then there exists a O-feasible process {M a (-)} which is consistent with the fam- 
ilies above, in the following sense. There exist stopping times Sk such that for 
each k > 1 

= tj, Sk — Sk+i = Tk+i 

j>k 

and the embedded process Tank(M a (Sk + -))\\{M a (Sk)}) is distributed as M k (-). 

Proof. By conditions (i)-(iii), for each k > 2 we can represent the process M^'" 1 
as the process M. k (Tk + ■); more precisely, we can couple the two processes such 
that 

(5.4) M*-\t) = rank(Mf (T k + t)\\{M k (T k ) , i > 1)). 

Then by the Kolmogorov consistency theorem we can assume this representa- 
tion holds simultaneously for all k. We now attach labels a to the component 
martingales by the following inductive scheme. For k — 1, to each of the 
indices i designating a component martingale M}{-) we associate an indepen- 
dent Uniform(0, 1) label. For k = 2, a component martingale Mf{-) might be 
zero or non-zero at T 2 . If non-zero then we copy the label already associated 
within M 1 (-) via the coupling f!5.4p . If zero the we create a new independent 
Uniform (0, 1) label. 

Continue for each k this scheme of copying or creating labels. For each 
label a, the sample path of that martingale component in the process M k+1 is 
obtained from the sample path in M k by inserting an extra initial segment. By 
(iv) the path converges as k — )■ oo to a function M a (t), < t < oo, and by (v) 
we must have M a (0) = 0. The remaining properties are straightforward. □ 

5.3. All p-feasible processes embed. Proposition [TT] enables construction 
of specific O-feasible processes. The following result implies that any p-feasible 
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process can be embedded into some O-feasible process - simply splice the 0- 
feasible process in the proposition to the given p-feasible process at time S. 
We already used this fact in the proofs of Corollary E^b) and Proposition |SJ 

Proposition 12. Given any ranked p, there exists a O-feasible process {M a (-)} 
and a stopping time S such that rank( {M a (S)} ) = p. 

For the proof it is convenient to use Brownian-type process instead of Wright- 
Fisher. Write 

Q(t) := ^(M,(t)) 2 . 

i 

We will use constructions with the property 

At each time < t < S, at least one component 

(5.5) martingale Mi(t) is evolving as Brownian motion 

for a specified stopping time S. That is, our constructions can be written as 

dMiit) = o-i(t)dWi(t) 

for (dependent) standard Brownian motions Wi(t), and we require that some 
o~i{t) equals 1. In general Q{t) — J J2i °f ( s ) ds is a martingale, so the advantage 
of property (15. 5p is that Q(t) — t is a submartingale, implying 

Lemma 13. Let (Mj(t)) be a p-feasible process satisfying / t5.5j) for a stopping 
time S. Then E[S] < E[Q(S)] - Q(0). 

A simple construction satisfying (15. 5p is the Brownian reflection coupling of 
two component martingales. That is, on < t < S we freeze components other 
than i,j, and set 

Mi(t) - Mi(0) = Wi(t) 7 Mjit) - M,(0) = -Wi(t). 

Lemma 14. Let Iq be countable, and Ii and I2 be finite, index sets. Let (pi,i G 
J0U/1) and (qi,i G J0U/2) be probability distributions which coincide on Iq and 
satisfy maxjg^pj < mm ie j 2 qi. Then there exists a p-feasible process (M a (-)} 
satisfying Ii5.5\) such that for some stopping time S we have rank( {M a (S)} ) = 
rank(q). 

Proof. Freeze permanently the component martingales with i G Iq. Pick two 
arbitrary indices i', i* in I\ and run the Brownian reflection coupling on these 
two components (t) , (t) until one component hits zero or min ig / 2 q^. In 
the latter case, freeze that component permanently and delete its index from 
Ji and delete argminj e / 2 qi from I2. In the former case, only delete the index 
from I\. The total number (originally |Ji| + I/2I) of unfrozen components is now 
decreased by at least 1. Continue inductively, picking two components from I\ 
at each stage. Eventually all components are frozen and the ranked state is 
rank(q). □ 
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Proof of Proposition d2J Define p° = p and for k > 1 construct p k from p by 

(i) retaining entries Pi with p$ < 2~ k ; 

(ii) replacing other p^ by 2 J W copies of 2 _J Wp i , where > 1 is the smallest 
integer such that 2~ j ^pi < 2~ k . 

Each pair (p fc ,p fc_1 ) satisfies the hypothesis of Lemma [HI So for each k, 
writting = 5 p k and writing M fc and T& for the p fc -feasible process and the 
stopping time given by Lemma [T^ we see that hypotheses (i)-(iii) of Proposition 
[TTJare satisfied. Moreover by Lemma [T3l we have E[T)t] < qu-i — Qk for qu '■= 
Y2i(Pi) 2 ' implying that hypotheses (i)-(iii) are also satisfied. The conclusion of 
Proposition [TT1 now establishes Proposition [T2"l □ 

6. The 0- Wright-Fisher process 

Write A for the (unranked) infinite simplex {(pi, 1 < i < oo) : Pi > 
0, ^2 i Pi = 1}. As mentioned in section I2.2[ for each p G A there exists the p- 
Wright-Fisher process, a process with sample paths in C([0, oo), A) and initial 
state p, which is the infinite-dimensional diffusion with generator analogous to 
(12. 5p starting from state p, and that this is a p-feasible process. This has a 
straightforward construction: given p G A, set p n = (pi, . . . ,p n -i, 2~2 m >nP m )' 
so the p n -process exists as a finite-dimensional diffusion. But there is a natural 
coupling between the p n_1 - and the p n -processes in which the first n — 2 co- 
ordinate processes coincide, and appealing to Kolmogorov consistency for the 
infinite sequences of processes we immediately obtain the p-process. 

Intuitively, we want to think of the 0- Wright-Fisher process as a suitable 
limit of the (1/n, 1/n, . . . , l/n)-Wright-Fisher processes as n — > oo. But in 
fact the limit in distribution, in the compactified space A = {(pi,l < i < 
oo) : Pi > 0, YliPi — 1}' i s t ne process which is identically (0,0,0,...). The 
foundational 1981 paper of Ethier and Kurtz [?] shows that a non-trivial limit 
X(£) = (Xi(t),i > 1) starting from (0,0,0,...) does exist if we work in the 
ranked infinite simplex V; more precisely the limit process has sample paths in 
C([0, oo), V) for the compactifed ranked simplex V, but for t > takes values 
in V. 

That process is in some senses the process we want, but that formalization 
does not suffice for our purposes because it does not preserve the identity of 
components as t varies. That is, we want the O-feasible process {M a (t)} whose 
components are martingales and for which 

(6.1) X(t) = rank({M a (t)}) with a separate ranking for each t. 

The component processes Xi(-) are not martingales and we cannot define quan- 
tities like Nb and D a b in terms of X. Note that by Lemma [10] we cannot 
represent X(t) as rank(M(£)) for any process in C([0,oo), A) with martingale 
components. 

Fortunately we can fit the 0- Wright-Fisher process into our abstract set-up 
by combining the existence of the process X(£) with our Proposition (TTJ Take 
times Sfc i and let /i& be the distribution of X(sfc). Then there is a de- 
feasible Wright-Fisher process M fc with initial distribution /i^, and existence of 
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the ranked Wright-Fisher process X implies that consistency condition (iii) of 
Proposition [TT] holds with T% = s^-i — and the conclusion of that proposition 
is that a O-feasible process satisfying ( 16. ip exists. 

6.1. Distributions associated with the 0- Wright-Fisher process. 

Problem 1. What are the distributions of Nj, and D a b for the 0- Wright-Fisher 

process? 

We remark that, if one only wanted to compute var(iVb), it would be sufficient 
to determine the limiting behavior of the quantity 

(6.2) P(supAfi(t) > b, supM 2 (t) > b\ Mi(0) = x, M 2 (0) = y) 

t t 

in the limit x, y I 0, where Mi, M 2 are the first two components of a 3-allele 
Wright-Fisher diffusion. We also note that the quantity (16. 2p coincides with 
the classical solution of the PDE \x{l — x)f xx + \y{l — y)f yy — xy f xy = on 
[0,6] x [0,b] with the boundary conditions f(x, 0) = f(0,y) = 0, f(x,b) = x/b, 
f{b,y) = y/b, provided that such a solution exists. We were not able to solve 
the PDE explicitly, so that even the question of finding var(A r ^) is an open 
problem. 

7. Final remarks and open problems 

We have already stated open problem [1] and Conjecture [TJ The discussion 
at the start of section 14.21 concerning constructions where D a b has small spread 
suggests the following closely analogous question concerning Brownian motions. 

Problem 2. For each 1 < % < k let (Bi(t), < t) be standard Brownian motion 
w.r.t. the same filtration, killed upon first hitting —1, and let Lj be the total 
local time of -Bj(-) at 0. How small can the ratio var[J^ =1 LJ/ var[Li] be? 

We do not know any relevant work, though Jim Pitman (personal commu- 
nication) observes that for k = 2 one can indeed have negative correlation 
between Li and L 2 . 
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